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ABSTRACT 

We introduce a new method for constraining the redshift distribution of a set of galaxies, using weak 
gravitational lensing shear. Instead of using observed shears and redshifts to constrain cosmological 
parameters, we ask how well the shears around clusters can constrain the redshifts, assuming fixed 
cosmological parameters. This provides a check on photometric redshifts, independent of source spec- 
tral energy distribution properties and therefore free of confounding factors such as misidentification 
of spectral breaks. We find that ~ 40 massive {a^ — 1200 km s~^) cluster lenses are sufficient to de- 
termine the fraction of sources in each of six coarse redshift bins to ^11%, given weak (20%) priors on 
the masses of the highest-redshift lenses, tight (5%) priors on the masses of the lowest-redshift lenses, 

and only modest (20-50%) priors on calibration and evolution effects. Additional massive lenses drive 
_ 1 

down uncertainties as N^^^^^ but the improvement slows as one is forced to use lenses further down the 
mass function. Future large surveys contain enough clusters to reach 1% precision in the bin fractions 
if the tight lens mass priors can be maintained for large samples of lenses. In practice this will be 
difficult to achieve, but the method may be valuable as a complement to other more precise methods 
because it is based on different physics and therefore has different systematic errors. 
Subject headings: surveys — gravitational lensing:weak — methods: statistical 



1. INTRODUCTION 

Photometric redshifts are of key importance to cur- 
rent and future galaxy surveys. In a typical application, 
galaxies might be binned according to photometric red- 
shift and then some analysis conducted on these binned 
source sets. Avoiding systematic error in this case re- 
quires knowledge of the true redshift distribution of each 
photometric redshift bin. Ideally, photometric redshift 
methods themselves produce reliable confidence intervals 
for this purpose, but external verification is important for 
controlling and quantifying systematic errors. For deep 
surveys, direct verification with a spectroscopic sample 
representative of the photometric sample is very difficult, 
and this has led to the development of other methods 
such as cross-correlating each photometric redshift bin 
with other photometric redshift bins (Schneider et al. 
2006, Erben et al. 2009) or with spectroscopic redshift 
bins (Newman 2008, Matthews & Newman 2010). Here 
we present a new and independent method for recon- 
structing source redshift distributions, using the shear 
from weak gravitational lensing. 

The shear induced by a given lens grows with source 
redshift in a well-understoood way which depends on 
cosmographic parameters. Other authors (Jain & Tay- 
lor 2003, Bernstein & Jain 2004) have suggested using 
this dependence to constrain the cosmographic parame- 
ters from the observed shear-vs-redshift relation around 
identified lenses. (Note that this is distinct from cosmic 
shear, for which the redshift dependence also involves the 
growth of structure.) In this paper we reverse the ques- 
tion and ask how well the redshift distribution of a set 
of galaxies can be constrained by shear measurements 
around lenses at a range of redshifts, if the background 
cosmology is already well determined (see Appendix [A| 
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for a demonstration that cosmological uncertainties are 
negligible here). This may prove useful for applications 
which take the background cosmology as given, and it 
also provides an internal consistency check for surveys. 

Although the shear around any given lens is rather 
weak and might be expected to provide very little con- 
straint, a very large survey such as LSST (Ivezic et al. 
2008) will contain billions of sources lensed by tens of 
thousands of galaxy clusters. Assuming the photometry 
is uniform over the sky, the entire dataset can be used 
to constrain the source redshift distribution. In this pa- 
per we use Fisher matrix formalism to estimate an upper 
bound on the effectiveness of this method, and we discuss 
challenges to implementing the method on real data. Be- 
cause this method requires a very large survey, we refer 
to LSST throughout. We refer the reader to the LSST 
Science Book (LSST Science Collaborations et al. 2009) 
for more details on LSST survey parameters. 

2. BASIC IDEA AND TOY MODEL 

The tangential shear 7 at a projected distance r around 
an axisymmetric lens is: 

7(0 = ^%^P«r)-E(.)] (1) 

where Dig, Di, and Dg are respectively the angular di- 
ameter distances from lens to source, observer to lens, 
and observer to source, S(r) is the projected surface 
mass density at r, and E(< r) is the mean projected 
surface mass density within r (Miralda-Escude 1991). 
Figure [T| (top panel) shows the behavior of the distance 
factor as a function of source redshift, for vari- 

ous fixed lens redshifts. This depends on the assumed 
cosmological model; we assume a WMAP7 ACDM cos- 
mology of Ha = 70.4 km s"! Mpc -\ f]„ = 0.272, 
and f^A = 0.728 throughout. It is clear that very low- 



2 



1000 



800 



200 









^' / 

- // / 


>/ 'o/ =0/ 
C57 0/ 


7 









8.0 0.5 1.0 1.5 2.0 2.5 3.0 



source redshift 



0.9 




source redshift 



Fig. 1. — Top: The distance factor — ^- as a function of source 

redshift, for various fixed lens redshifts. The lens redshift for each 
curve can be read off at the point where the curve departs from 
zero. Bottom: as for the left panel, but plotting /3 = . 

redshift [z < 0.2) source galaxies will have no measurable 
shear around any of the lenses in the figure, moderate- 
redshift (~ 0.5) sources will have measurable shear only 
around the lower-redshift lenses, and high-redshift (z > 
1) sources will have measurable shear around all lenses. 
This is the conceptual basis for constraining the redshift 
distribution of a source set: one should be able to solve 
for the source redshift distribution which, when passed 
through the appropriate response curves, best matches 
the observed shears. The method requires lenses at a 
range of redshifts and, to obtain good constraints, some 
prior knowledge of the lens masses. To simplify the ex- 
position we treat the lens propertise as known in this 
section, and explore requirements on prior knowledge of 
lens properties in the next section. 

Individual galaxies as well as clusters of galaxies can 
serve as lenses. Typically, the shear one arcminute off 
axis might be of order 0.5 for clusters and 0.01 for galax- 
ies. Galaxies have near-isothermal mass profiles (Gavazzi 
et al. 2007), in which E(r) and S(< r) scale as r~^; 



specifically, !](< r) — I](r) = where o-„ is the ve- 
locity dispersion. Because this scales as r~^, it scales as 
(ODi)^^ in terms of projected angular distance 0, and 
7(6*) can be written 

Us 

Isothermal lenses thus have the convenient property that 
7(0) can be separated into a part which depends only on 
the cosmology and a part which depends only on the mass 
or velocity dispersion of the lens. Mass profiles which 
are not scale-free cannot be written this way because the 
conversion from physical to angular scales invokes the 
cosmological model through Di. This convenience is not 
required for our argument, but will substantially simplify 
the exposition here. Although cluster mass profiles de- 
part from isothermality more than galaxy profiles do, for 
this section we will assume isothermality to simplify our 
argument. 

We can now write a very simple model for the shear of 
the ith source galaxy due to the jth lens. We henceforth 
refer to as s to eliminate confusion with the symbol 
a which we will use later to indicate uncertainties on 
measurements. We also define /? = jj^ (taking (3 to be 

zero where Zs < zi); the /3 curves for various lenses are 
shown in the bottom panel of Figure [l] We can then 
write 

^^o-if^^o^r (2) 

where 0ij is the angular separation between lens j and 
source i, Pij is the distance ratio between the two, and 
Sj is the velocity dispersion of lens j. 

The redshift distribution of the source set can be pa- 
rameterized in many ways. Here we adopt the simplest 
approach, uniform bins in redshift. With A'bin bins, we 
parametrize the distribution with /i, /2, •■•/AfbinJ where 
fm is the fraction of the galaxies in the set which are ac- 
tually in redshift bin to. The highest-redshift bin should 
extend to arbitrarily high redshift so that none of the 
measured shear can come from an unmodeled part of 
the distributionj^ Consequently, we do not require that 
Hm^ra = 1; any difference between unity and J2m fm 
can be interpreted as due to sources at such low red- 
shift that they are never lensed. (A derived parameter 
/o = 1 — J2m /™ '^^^ tie defined which encodes the frac- 
tion of sources at ultralow redshifts, but is not a param- 
eter we solve for directly.) For sources from an unknown 
redshift distribution, the expected value of P is simply 
J2m fmPmj , where f^mj is now the mean distance ratio 
between lens j and source redshift bin m. Thus the ex- 
pected shear for a lens-source combination is 

(7u) = ^7rE/™/3"^- (3) 

C Uij 

In this section, we will assume that Sj is perfectly known; 
we will consider uncertainty in lens masses in ^|3j 

^ Because /3 becomes fairly flat at high redshift, a very wide 
high-redshift bin is no more problematic than a modest- width low- 
redshift bin in terms of variation of (5 within each bin. 
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The observable is the total sheai0 on each source 
galaxy due to all lenses which, in the weak lensing limit, 
is 7i = Jij- However, we must add shears componen- 
twise because in the multiple-deflector context there is 
no longer a single "tangential" component. Thus shear 
is usually written as a complex quantity: 
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where (pij is the position angle of source i with respect to 
lens j, and the i in the exponent represents ^/—l. The 
total expected shear on the source is then 



(7.) 



27r si 



We could form a Fisher matrix with each two- 
component 7i contributing two observables; the deriva- 
tives of these observables with respect to the model pa- 
rameters fm are very straightforward. However, this is 
computationally inefhcient because most sources will be 
far from any modeled lens (assuming we use massive clus- 
ters rather than individual galaxies as the lenses) and 
thus will contribute negligible information to the Fisher 
matrix; and of the remaining sources, most are near only 
one lens so that their contribution can be captured by a 
single (tangential) component. Thus, at the expense of 
very little information, we can efhciently group sources 
according to their relevant lenses. 

To do this, we group the observables in Equation |3] on 
one side and the model parameters on the other: 
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The e'^'^'J factor no longer appears because with only one 
lens involved the tangential component is unambiguous. 
We can collapse this down to a single observable for each 
lens by taking a mean of sources relevant to that lens: 
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where i € j denotes averaging over all sources projected 
close enough to lens j to add non- negligible information 
to the Fisher matrix. Accordingly, 
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To construct the Fisher matrix elements, we also need 
the variance of Vj . Because there is essentially no uncer- 
tainty on 6ij, the variance of ^ijOij is erf Of j where Ui is 
the uncertainty on the measured shear of galaxy i. Fur- 
thermore, Ui is uncorrelated with position or lens, so we 
can approximate it as a constant and pull it out of 
any sums. Weighting sources by inverse variance when 

^ Technically, the observable is the reduced shear where k 
is the convergence, but the argument still holds. The effect of re is 
to change the profile somewhat, which is not important for the toy 
model here. 



computing F, , we then find that 
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This sum primarily involves properties of the survey 
rather than the lens. We can rewrite it as the integral of 
the areal density of the source set, n, over the "footprint" 
of the lens, that is, the area over which the lens con- 
tributes non- negligible Fisher information: n J 9~^dA. 
(Note that n here is in units of galaxies per steradian, 
because 6 is in units of radians. We will convert to the 
more typically quoted units of arcmin^^ at the end of 
the calculation.) We shall see that the value of this inte- 
gral is only weakly related to the properties of the lens. 
Integrating from some ^min to some ^max yields 



e-'^dA = 2'Kn 
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(8) 



(9) 



Because the dependence is only logarithmic, we can make 
a rough estimate of for a typical lens and apply it 
to all lenses without much loss of precision. Sources less 
than ~ 1' from the lens center are typically not used 
in weak lensing analyses because of the risk of contami- 
nation by cluster members and/or errors on their shear 
measurements due to background gradients from clus- 
ter members, so we adopt this value of 6'min- At large 
enough 9, shear from unrelated structures will dominate 
the shear from the cluster. Although information about 
the cluster is still there, as a practical matter the effort 
to recover it may not be justified given the logarithmic 
dependence on ^max- The shear around a massive clus- 
ter can clearly be observed out to at least 15' from the 
center (e.g, Kling et al. 2005), so ^max — 15' should be 
a conservative value. Thus Equation [9] is approximately 
equal to 17n, and only weakly dependent on the adopted 
values of ^min and ^max- We expect the dependence on 
assumed shear profile to be weak as well, given that many 
ground-based weak-lensing cluster studies have difficulty 
discriminating between different forms of shear profiles 
(Wittman 2002) . Inserting this result into Equation [jj 
we find that 

17n 

Assuming a Gaussian likelihood model, we now have 
an iVbin by A^bin Fisher matrix: 
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where we have adopted a-y — 0.2, which represents the 
irreducible "shape noise" due to the random intrinsic ori- 
entations of sources. Smaller, fainter, sources will have 
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larger shear uncertainties due to measurement uncertain- 
ties: erf = (T^ + cr^ioas i ■ We account for this by defining 
n as the effective source density, that is, the equivalent 
number of perfectly measured sources provided by the 
source set: n — a?,Y] , , \ where the sum is taken 

over a unit area of sky. This is the basis on which surveys 
or source sets should be quoted; see the explanation of 
Equation 14.7 in the LSST Science Book (LSST Science 
Collaborations et al. 2009) for a more detailed justifica- 
tion. 

As a rough estimate of the potential of this method, 
we compute a Fisher matrix forecast for an LSST- 
depth [n = 40 arcmin"^) survey covering nineteen 
lenses with = 1200 km at redshifts z = 

{0.05, 0.1, 0.15, 0.95}, which roughly corresponds to 
using the most massive lens in the sky at each redshift. 
We model six redshift bins: five covering < z < 1 
with equal spacing, plus a sixth bin containing all galax- 
ies at z > 1. The forecast uncertainties on are then 
{0.06, 0.07, 0.11, 0.17, 0.20, 0.10}. The same Fisher infor- 
mation can be obtained using a larger number of less- 
massive lenses, but going beyond a small number of well- 
studied lenses introduces a new problem: lens parame- 
ters such as (T„ will not be known precisely for all these 
lenses, and we must examine the effect of marginalizing 
over uncertainties in the lens properties. 

3. MARGINALIZING OVER LENS MASS 

The two lens parameters in this model are redshift 
and mass (or equivalently, velocity dispersion). Spectro- 
scopic redshifts are available in the literature for thou- 
sands of clusters, particularly for the most massive ones 
which contribute the most Fisher information. But with 
large imaging surveys now finding more clusters than can 
be followed up with spectroscopic redshift confirmation, 
photometric redshifts for the lenses must be considered. 
Photometric redshifts are much more precise for clusters 
than for individual galaxies, even beyond the statistical 
effect of averaging many member galaxies. Because clus- 
ter members are preferentially early-type galaxies with 
well-established spectral energy distributions, their red- 
shifts can be established with very high confidence. Fur- 
thermore, the accuracy of cluster photometric redshifts 
can be easily verified by spectroscopy of a representative 
subsample of clusters, whereas representativeness is very 
difficult to achieve in spectroscopic subsamples of indi- 
vidual galaxies. The LSST Science Book (LSST Science 
Collaborations et al 2009) forecasts photometric redshift 
uncertainties for modest (lO^^M©) clusters as a function 
of redshift; their Figure 13.12 shows that the uncertain- 
ties are 0.01 or less for all z < 1.1 after a single visit (and 
for all z < 1.9 after ten years of the survey). This fixes 
the /? factors to within 0.01, which is far more precise 
than the masses will be known a priori. We can there- 
fore safely ignore uncertainties in lens redshifts. In the 
remainder of this section, we focus on marginalizing over 
lens mass uncertainties. 

We now regard the s| as nuisance parameters. We 

choose Sj rather than Sj simply for convenience: s| is 
conceptually convenient because it is linearly propor- 
tional to the shear, and this proportionality makes it 
fall out of the relevant Fisher matrix elements. From 



Equation [5] we have 
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otherwise. 
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We now have a Fisher matrix which is A^bin + Mens el- 
ements square. The lower-right Mens x iVicns block is 



diagonal, because 
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— where j ^ k. These di- 



agonal elements look like the square of the first case in 
Equation [Ts] The upper- left Abin x Abin block contains 
only elements of the form in Equation 11 and the other 
blocks (the Aliens X Abin block in the upper right and its 
transpose in the lower left) contain mixed terms. Note, 
however, that many of the elements in these latter blocks 
will be zero, corresponding to lens/bin combinations for 
which the bin redshift is lower than the lens redshift, 
and thus for which f3mj = 0. In the upper left block, no 
elements will vanish unless all lenses are at a higher red- 
shift than the lowest-redshift bin. Note that the /„ now 
appear in the Fisher matrix, so we must adopt fiducial 
values. Fisher matrix estimates should not be trusted 
far from the fiducial values; see Albrccht et al. (2006) 
for a clear exposition of Fisher matrix limitations. We 
therefore adopt a uniform fiducial distribution for now, 
and examine the dependence on the fiducial distribution 
at the end of this section. 

Next, we need a prior on the lens masses. With no 
prior, the lowest-redshift lens would have no constrain- 
ing power at all, as any shear (or lack thereof) in the 
lowest-redshift bin could be explained as a result of ar- 
bitrarily large (or small) lens mass. Successively higher- 
redshift lenses would have some constraining power, but 
not much. We therefore explore the importance of this 
prior by running Fisher matrix forecasts with the lens set 
of the previous section, but with priors of 1%, 2%, 5%, 
10%, and 20% on the lens masses assuming the fiducial 
Sj = 1200 km s^^. We implement this by adding to the 
Fisher matrix the inverse variance representing a Gaus- 
sian prior before inverting the Fisher matrix to obtain 
the covariance matrix. To implement the 1% prior, for 
example, we add (0.01(1200^))^^ to ^7,7, ^8,8, ■•■•?^25,25, 
which are the nineteen elements representing the lens s'j . 
Figure [2] shows the results. For the low-redshift bins, ob- 
taining constraints close to those of the previous section 
requires a 1-2% prior, but for higher-redshift bins the re- 
quired priors are more modest, 10-20%. Priors tighter 
than 1% (not shown) do not yield any visible improve- 
ment in this figure. 

The most eflficient observational approach may thus be 
to obtain tight priors on low-redshift lenses using veloc- 
ity dispersions. X-ray analyses, external lensing analyses, 
etc, while obtaining only very rough priors on the higher- 
redshift lenses. We model this scenario by testing a mass 
prior which is 5% at zicns — but loosens as (1 -I- zicns)^ 
(i.e., a 20% prior at zions = 1); this is is shown as the 
dashed curve in Figure [2] This yields source redshift con- 
straints as good as those obtained with a 5-10% prior, but 
without much difficult high-redshift work. Indeed, opti- 
cal richness alone can provide a 20-30% prior on the mass 
(Rykoff et al. 2011), thus obviating the need for foUowup 
at the high-redshift, assuming there is no redshift above 
which the optical richness estimator breaks down. 

To a reasonable approximation, each additional lens 
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Fig. 2. — Constraints on the bin fractions for various lens-mass 
priors in the simphfied physical scenario of Sj2]with just one mas- 
sive lens at each redshift. The redshift- varymg prior allows for 
weaker lens-mass priors at higher lens redshifts where priors are 
both less necessary and more difficult to establish observationally; 
this particular form is 5%(1 -I- zions)'^ which brings the prior to 
20% at zicjis = 1- More lenses of the same mass will tighten the 
„ 1 

constraints as A^j^^^^^. 

tightens the constraints mostly on the 1-2 source red- 
shift bins nearest the lens. Therefore, we can tweak the 
lens sample to obtain roughly equal constraints on all 
source redshift bins. For example, we see from Figure [2] 
that the maximum uncertainty for the efficient redshift- 
varying mass prior is for source redshifts 0.7-0.9, at a 
level of roughly twice the minimum uncertainty. Tripling 
the number of lenses at redshifts 0.5-0.95, for a total of 
39 massive lenses, yields roughly equal constraints on 
most source redshift bins, averaging 0.11 (middle curve 
in Figure |3]). Reaching yet lower levels of uncertainty 
requires more lenses overall. For example, reaching 0.01 
uncertainty across the board will require 11^ times more 
lenses than the previous scenario, or about 5000 total. 
However, when so many lenses are considered, we can- 
not use only very massive lenses. Therefore, the required 
number of lenses will scale much more steeply than the 
inverse square of the desired constraint. The Fisher in- 
formation corresponding to 5000 lenses with = 1200 
km s^^ could be accumulated with tens of thousands of 
less-massive lenses, which is well within expectations for 
current and future large imaging surveys. Computation- 
ally, using so many lenses should be feasible because most 
of the tens of thousands of nuisance parameters can be 
marginalized over separately, reflecting the nonoverlap- 
ping nature of most lenses. However, the 1% goal will be 
difficult to achieve at low redshift due to the difficulty of 
obtaining the 5-8% lens-mass priors necessary to do so 
well there. 

These factors provide insight into why we do not pro- 
pose using individual galaxies as lenses. For every 10^^ 
Mq cluster, a survey would need one milhon 10^^ Mq 
galaxies to provide the same Fisher information, all other 
factors being equal. Although surveys such as LSST will 
have billions of galaxies, all other factors are not equal. 
The photometric redshift uncertainty on each individual 
galaxy is much larger than for clusters, and individual- 
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Fig. 3. — Constraints on the bin fractions for low-redshift (lower 
curve), flat (middle curve), and high-redshift (upper curve) source 
sets. Each source set is used with the redshift- varying lens-mass 
prior and 39 lenses described in the text. 

galaxy lens photometric redshifts bring in exactly the 
problems this method is designed to avoid. These prob- 
lems could be avoided by using only early-type galaxies 
with well-behaved photometric redshifts and mass priors 
from the fundamental plane, but then far too few lenses 
would be available. Furthermore, galaxy-scale lenses do 
not dominate their local shear in a way which allows par- 
allelized modeling of designated patches of sky, and thus 
would introduce additional computational challenges. 

A foreseeable application for this method is to input 
source sets corresponding to each of several photometric 
redshift bins, to determine the outlier fraction and distri- 
bution in each. Therefore, we examine the dependence 
on the redshift distribution of the source set, which we 
have assumed to be flat until now. Figure |3] shows the 
results using the redshift-varying mass prior and the 39 
massive lenses with three different source sets: flat (as 
described above, middle curve), all in the 2 < 0.2 bin 
(lower curve), and all in the 0.8 < z < 1.0 bin (upper 
curve) . Characterization of outliers will be more precise 
for the low-Zphot source set than for the high-Zphot source 
set, although this is partly due to the choice of lens-mass 
prior which is tighter at lower redshift. 

4. PROFILE AND CALIBRATION UNCERTAINTIES 

To this point we have assumed that prior knowledge of 
a cluster's velocity dispersiorj^ translates perfectly into 
prior knowledge of its integrated shear statistic, given a 
source redshift distribution. However, the relationship 
between these two observables is sure to be more com- 
plicated. Clusters with the same velocity dispersion may 
have different profiles, resulting in different integrated 
shear estimates. If this is a purely stochastic effect, then 
the scatter in the observable-observable relation weakens 
the prior. But in principle, the loss of information due to 
this scatter can be compensated by analyzing more clus- 
ters. Conceptually more worrisome would be a redshift- 

Throughout this section we refer to velocity dispersion for con- 
creteness, hvit Sunyaev-Zel'dovich effect, X-ray, or optical richness 
measurements could equally well be used to set the prior. 
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Fig. 4. — Constraints resulting from a range of calibration (top 
panel) and evolution (bottom panel) priors. For the top panel the 
evolution prior has been set to 20% in all cases, and for the bottom 
panel the calibration prior has been set to 20% in all cases. Priors 
tighter than 20% on either do not substantially improve the results: 
the bottom curve in each panel is very similar to the middle curve 
in Figure |3] where the priors were delta functions. 

dependent trend or an overall calibration error in this 
relationship, which would cause systematic errors. 

We therefore insert nuisance parameters describing the 
uncertainties in calibration and redshift evolution, and 
explore how the results degrade as priors on these pa- 
rameters are relaxed. We allow Equation [5] to be multi- 
plied by a factor a + bzj , where a is a nuisance parameter 
for the calibration, 6 is a nuisance parameter for the red- 
shift evolution, and Zj is the redshift of the jth lens, and 
repeat the analysis of the previous section with the flat 
source distribution and the redshift-dependent lens mass 
prior. Figure [4] shows the reconstruction precision for a 
range of calibration and evolution priors, with a fiducial 
model of a = 1 and 5 = 0. Priors of 20% on each are suf- 
ficient to preserve performance nearly identical to that of 
the previous section in which calibration and evolution 
were not factors; the bottom curve in each panel is very 
similar to the middle curve in Figure [Sj 

We stress that the choice of an isotnermal profile as a 



calculation aid here is not fundamental to the method. In 
practice, surveys would measure the average cluster pro- 
file and construct an optimal integrated shear estimator 
before commencing the redshift analysis. The results of 
this section indicate that profile calibration and evolution 
uncertainties at the 20-50% level do not substantially de- 
grade the performance of this method. 

5. DISCUSSION 

We have outlined a new method which uses gravita- 
tional lensing by clusters and knowledge of the cosmology 
to constrain source redshift distributions. Although this 
is an unorthodox use of the lensing-redshift information, 
it has some strengths and some relevant applications. 
The method's strengths are: 

• it is independent of other methods. It relies on 
no assumptions about galaxy spectral properties 
or biasing. 

• it is applicable to any set of galaxies. One can de- 
termine the redshift distribution of a set of galaxies 
chosen in any number of ways, such as photometric 
redshift cut, pure color selection, or even X-ray or 
radio selection. Multiwavelength observations are 
not necessary as they are for photometric redshifts. 

• in the context of a large imaging survey, it requires 
little additional data. Where lens spectroscopic 
redshifts are not available, photometric lens red- 
shifts will do, and optical richness provides a suf- 
ficient lens-mass prior for the high-redshift lenses. 
However, pinning down the low-redshift end of the 
source distribution will require strong mass priors 
on low-redshift lenses. 

Our Fisher matrix forecast involves some approxima- 
tions and caveats: 

• Lens morphology: we have assumed axisymmetric 
isothermal lenses. Although we have argued that 
other profiles would not yield substantially differ- 
ent forecasts, a precise experiment would include 
some sort of marginalization over profiles. This 
marginalization might include allowances for non- 
axisymmetry. 

• Foreground/background structure: we have as- 
sumed that shear from foreground/background 
structure is negligible in the "footprint" of each 
lens, and totally dominant outside. In reality, un- 
modeled shear will be present in the footprint as 
well, and contribute to slightly looser constraints. 
However, unmodeled shear is not an irreducible 
source of noise. One can choose to model pre- 
viously unmodeled structure and thus add infor- 
mation to the Fisher matrix. This entails a cost- 
benefit analysis rather than a hard limit. 

• Magnification: the observed source population be- 
hind lenses will be slightly different than in other 
locations, due to lensing magnification. If un- 
modeled, this may result in a nontrivial system- 
atic error. However, because the lenses are be- 
ing modeled, the model magnification at any given 
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point provides an easily available basis for correc- 
tion. Deep fields which establish the source counts 
fainter than the main survey flux limit will be 
very useful for this purpose. As with the fore- 
ground/background structure, it may be possible 
to add information to the Fisher matrix with im- 
proved modeling of the magnification. 

• Cluster contamination: the areas around clusters 
differ from the general area of a survey in another 
respect, the presence of cluster members. Care 
must be taken so that the source set whose shear is 
being measured is not contaminated by these mem- 
bers. 

• Cosmology dependence of /3: Appendix |X] shows 
that marginalizing over uncertainty in cosmo- 
graphic parameters will not degrade the results 
presented here. But using the cosmology as input 
here implies that the redshift estimates from this 
method should not feed in to some other method 
which estimates the cosmology unless the covari- 
ances are properly treated. 

• Wide source redshift bins: given a lens redshift, 
P varies over the width of a source redshift bin. 
Accurately modeling this effect will require some 
assumptions about the source redshift distribution 
inside the bins. It may be possible to find a param- 
eterization which reduces this difficulty. 

The source redshift distributions obtained with this 
method will be most useful in applications where the 
background cosmology would have been assumed any- 



way, such as galaxy evolution, and especially in appli- 
cations which are very sensitive to photometric redshift 
outliers. For example, measurements of the bright end of 
the galaxy luminosity function at high photometric red- 
shift are extremely sensitive to catastrophic outliers, as 
a small fraction of low-redshift galaxies mistakenly put 
at high redshift results in many more "high-luminosity" 
galaxies at that redshift. Surveys can use this method to 
independently derive the full redshift distribution of the 
galaxies in each photometric redshift bin, thus charac- 
terizing the number and distribution of outliers. Indeed, 
the use of multiwavelength photometry to infer both 
source redshift and source spectral/photometric proper- 
ties can be problematic, and this method offers a possible 
workaround. 

Other methods such as cross-correlation with a spec- 
troscopic sample (Newman 2008, Matthews & Newman 
2010) should be able to achieve 1% accuracy without 
assuming a cosmology or requiring expensive lens mass 
priors. This method may therefore not be competitive 
statistically, but as it is based on different physics it may 
offer value in having different systematic errors. Redshift 
distributions derived using this method can and should 
be used in consistency checks. Given the best-fit cosmol- 
ogy from weak lensing tomography, this method should 
produce source redshift distributions which are consis- 
tent with those used in the weak lensing tomography. A 
prime application for this method may therefore be as 
an internal consistency check for surveys. 

We thank J. A. Tyson for suggesting that we pursue the 
idea of using lensing to constrain source redshift distribu- 
tions, and the anonymous referee for suggestions which 
led to improvements in the paper. 



APPENDIX 

DEPENDENCE ON COSMOLOGICAL PARAMETERS 

This appendix justifies the assumption in the main body of the paper that uncertainties in the distance ratios 
contribute negligible uncertainty to the parameters of interest, the fraction of sources in each redshift bin. Uncertainty 
in the distance ratios comes from uncertainty in cosmographic parameters, including Hubble's constant Hq and the 
energy densities in matter (llm), curvature (rJfe), and in the cosmological constant (fiA)- Of course, more general 
dark energy models can introduce further parameters, but the cosmological constant has a greater effect on distances 
than a dark energy model with equation of state parameter w > —1. Therefore, if the assumption is justified for a 
cosmological constant model, it is justified for models with w > — 1. 

Distances depend linearly on Hubble's constant iJo, but j3 = jj^ is a ratio of distances. Therefore /3 does not depend 
on Hq and uncertainty in Hq is irrelevant. 

The remaining parameters are more difficult to treat analytically, so we make a numerical estimate. The derivatives 
of (3 with respect to rim and J^a (with flk — 1 — f^m — ^^a) depend on the lens and source redshifts, but are typically 
of order 0.02. The worst case is for uncertainty in Vim when considering high-redshift sources and lenses, where 
« 0.15. We take a conservative approach by setting — 0-15 for all sources and lenses, so that the partial 
derivative of Equation Js] with respect to il„i is 0.15 times |f Sj- We include flm as a parameter in the Fisher matrix 
using these partial derivatives. 

We rerun the forecast with a Gaussian prior of a — 0.016 on flm, which corresponds to the current 
WMAP7+BAO+II0 uncertainty for the "OLCDM" model in which curvature is allowed to vary0 If the results 
were plotted on Figure [2] or Figure [3j they would be nearly indistinguishable from the existing plots. The biggest 
difference would be the point in the lower right corner of Figure [3] increasing from 0.052 to 0.055. The effect of varying 
r^A is even smaller because /3 is less sensitive to it. 
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